Picture for Wenhan Luo

Wenhan Luo

ROAR-3D: Routing Arbitrary Views for High-Fidelity 3D Generation

Add code
May 20, 2026
Viaarxiv icon

STiTch: Semantic Transition and Transportation in Collaboration for Training-Free Zero-Shot Composed Image Retrieval

Add code
May 20, 2026
Viaarxiv icon

Tango3D: Towards Alignment for Global and Local 2D-3D Correspondence

Add code
May 19, 2026
Viaarxiv icon

Forcing-KV: Hybrid KV Cache Compression for Efficient Autoregressive Video Diffusion Models

Add code
May 10, 2026
Viaarxiv icon

Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling

Add code
Feb 11, 2026
Viaarxiv icon

UniSH: Unifying Scene and Human Reconstruction in a Feed-Forward Pass

Add code
Jan 03, 2026
Viaarxiv icon

Visual-Aware CoT: Achieving High-Fidelity Visual Consistency in Unified Models

Add code
Dec 22, 2025
Viaarxiv icon

CogniEdit: Dense Gradient Flow Optimization for Fine-Grained Image Editing

Add code
Dec 15, 2025
Viaarxiv icon

ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning

Add code
Dec 11, 2025
Figure 1 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 2 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 3 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 4 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Viaarxiv icon

InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing

Add code
Aug 19, 2025
Figure 1 for InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
Figure 2 for InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
Figure 3 for InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
Figure 4 for InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
Viaarxiv icon